Simulating quantum computation by contracting tensor networks 
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Abstract 

The treewidth of a graph is a useful combinatorial measure of how close the graph is to a tree. We prove that 
a quantum circuit with T gates whose underlying graph has treewidth d can be simulated deterministically in 
r^('' exp[C?(J)] time, which, in particular, is polynomial in 7 if J = O(logr). Among many implications, 
we show efficient simulations for log-depth circuits whose gates apply to nearby qubits only, a natural 
constraint satisfied by most physical implementations. We also show that one-way quantum computation of 
Raussendorf and Briegel {Physical Review Letters, 86:5188-5191, 2001), a universal quantum computation 
scheme with promising physical implementations, can be efficiently simulated by a randomized algorithm 
if its quantum resource is derived from a small-treewidth graph. 
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1 Introduction 



The recent interest in quantum circuits is motivated by several complementary considerations. Quantum 
information processing is rapidly becoming a reality as it allows manipulating matter at unprecedented scale. 
Such manipulations may create particular entangled states or implement specific quantum evolutions — they 
find uses in atomic clocks, ultra-precise metrology, high-resolution lithography, optical communication, etc. 
On the other hand, engineers traditionally simulate new designs before implementing them. Such simulation 
may identify subtle design flaws and save both costs and effort. It typically uses well-understood host 
hardware, e.g., one can simulate a quantum circuit on a commonly-used conventional computer. 

More ambitiously, quantum circuits compete with conventional computing and communication. Quantum- 
mechanical effects may potentially lead to computational speed-ups, more secure or more efficient commu- 
nication, better keeping of secrets, etc. To this end, one seeks new circuits and algorithms with revolutionary 
behavior as in Shor's work on number-factoring, or provable limits on possible behaviors. While proving 
abstract limitations on the success of unknown algorithms appears more difficult, a common line of rea- 
soning for such results is based on simulation. For example, if the behavior of a quantum circuit can be 
faithfully simulated on a conventional computer, then the possible speed-up achieved by the quantum circuit 
is limited by the cost of simulation. Thus, aside from sanity-checking new designs for quantum information- 
processing hardware, more efficient simulation can lead to sharper bounds on all possible algorithms. 

Since the outcome of a quantum computation is probabilistic, we shall clarify our notion of simulation. 
By a randomized simulation, we mean a classical randomized algorithm whose output distribution on an 
input is identical to that of the simulated quantum computation. By a deterministic simulation, we mean a 
classical deterministic algorithm which, on a given pair of input x and output y of the quantum computation, 
outputs the probability that y is observed at the end of the quantum computation on x. 

To simulate a quantum circuit, one may use a naive brute-force calculation of quantum amplitudes that 
has exponential overhead. Achieving significantly smaller overhead in the generic case appears hopeless 
— in fact, this observation lead Feynman to suggest that quantum computers may outperform conventional 
ones in some tasks. Therefore, only certain restricted classes of quantum circuits were studied in existing 
literature on simulation. 

Classes of quantum circuits that admit efficient simulation are often distinguished by a restricted "gate 
library", but do not impose additional restrictions on how gates are interconnected or sequenced. A case in 
point is the seminal Gottesman-Knill Theorem |[T3l and its recent improvement by Aaronson and Gottes- 
man Oj. These results apply only to circuits with stabilizer gates — Controlled-NOT, Hadamard, Phase, 
and single-qubit measurements in the so called Clifford group. Another example is given by match gates 
defined and studied by Valiant |[34l . and extended by Terhal and DiVincenzo |[32l . 

A different way to impose a restriction on a class of quantum circuits is to limit the amount of entan- 
glement in intermediate states. Jozsa and Linden tl7] . as well as Vidal lt37| demonstrate efficient classical 
simulation of such circuits and conclude that achieving quantum speed-ups requires more than a bounded 
amount of entanglement. 

In this work we pursue a different approach to efficient simulation and allow the use of arbitrary gates. 
More specifically, we assume a general quantum circuit model in which a gate is a general quantum operation 
(so called physically realizable operators) on a constant number of qubits. This model, proposed and studied 
by Aharonov, Kitaev and Nisan |f2|, generalizes the standard quantum circuit model, defined by Yao BTl . 
where each gate is unitary and measurements are applied at the end of the computation. We also assume 
that (i) the computation starts with a fixed unentangled state in the computational basis, and (ii) at the end 
each qubit is either measured or traced-out. 

Our simulation builds upon the framework of tensor network contraction. Being a direct generaUzation of 
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matrices, tensors capture a wide range of linear phenomena including vectors, operators, multi-lineai^ forms, 
etc. They facilitate convenient and fundamental mathematical tools in many branches of physics such as 
fluid and solid mechanics, and general relativity ifTSl . More recently, several methods have been developed 
to simulate quantum evolution by contracting variants of tensor networks, under the names of Matrix Prod- 
uct States (MPS), Projected Entangled Pairs States (PEPS), etc (371 |38l IMl IH |35l |36l IHl- Under this 
framework, a quantum circuit is regarded as a network of tensors. The simulation contracts edges one by 
one and performs the convolution of the corresponding tensors, until there is only one vertex left. Having 
degree 0, this vertex must be labeled by a single number, which gives the final measurement probability 
sought by simulation. In contrast with other simulation techniques, we do not necessarily simulate individ- 
ual gates in their original order — in fact, a given gate may even be simulated partially at several stages of 
the simulation. 

While tensor network contraction has been used in previous work, little was known about optimal con- 
traction orders. We prove that the minimal cost of contraction is determined by the treewidth tw (Gc) of the 
circuit graph Gc- Moreover, existing constructions that approximate optimal tree-decompositions (e.g. |!29l) 
produce near-optimal contraction sequences. We shall define the concepts of treewidth and tree decom- 
positions in Section 2. Intuitively, the smaller a graph's treewidth is, the closer it is to a tree, and a tree 
decomposition is a drawing of the graph to make it look like a tree as much as possible. Our result al- 
lows us to leverage the extensive graph-theoretical literature dealing with the properties and computation of 
treewidth. 

Theorem 1.1. Let C be a quantum circuit with T gates and whose underlying circuit graph is Gc- Then C 
can be simulated detenninistically in time T'^'-'' exp[C?(tw(Gc))]. 

A rigorous restatement of the above theorem is Theorem 14.61 By this theorem, given a function com- 
putable in polynomial time by a quantum algorithm but not classically, any polynomial-size quantum circuit 
computing the function must have super-logaiithmic treewidth. 

The following corollary is an immediate consequence. 

Corollary 1.2. Any polynomial-size quantum circuit of a logarithmic treewidth can be simulated detennin- 
istically in polynomial time. 

Quantum formulas defined and studied by Yao ll4ll are quantum circuits whose underlying graphs are 
trees. Roychowdhury and Vatan |[3TI showed that quantum formulas can be efficiently simulated deter- 
ministically. Since every quantum formula has treewidth 1, Corollary 11.21 gives an alternative efficient 
simulation. 

Our focus on the topology of the quantum circuit allows us to accommodate arbitrary gates, as long as 
their qubit- width (number of inputs) is limited by a constant. In particular. Corollary 11.21 implies efficient 
simulation of some circuits that create the maximum amount of entanglement in a partition of the qubits, 
e.g., a layer of two-qubit gates. Therefore, our results are not implied by previously published techniques. 

We now articulate some implications of our main result to classes of quantum circuits, in terms of prop- 
erties of their underlying graphs. The following two classes of graphs are well-studied, and their treewidths 
are known. The class of series parallel graphs arises in electric circuits, and such circuits have treewidth 
< 2. Planar graphs G with n vertices are known to have treewidth tw(G) = 0{y^\V{G)\) ll4l . 

Corollary 1.3. Any polynomial size parallel serial quantum circuit can be simulated detenninistically in 
polynomial time- 

CoroUary 1.4. A size T planar quantum circuit can be simulated deterministically in exp[0(\/r)] time- 
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Another corollary deals with a topological restriction representative of many physical realizations of 
quantum circuits. Let ^ > 1 be an integer. A circuit is said to be <7-local-interacting if under a linear 
ordering of its qubits, each gate acts only on qubits that are at most q distance apart. A circuit is said to 
be local-interacting if it is ^-local interacting with a constant q independent of the circuit size. Such local- 
interaction circuits generalize the restriction of qubit couplings to nearest-neighbor qubits (e.g., in a spin- 
chain) commonly appearing in proposals for building quantum computers, where qubits may be stationary 
and cannot be coupled arbitrarily. To this end, we observe that the treewidth of any local-interaction circuit 
of logarithmic depth is at most logarithmic. 

Corollary 1.5. Let C be a quantum circuit of size T and depth D, and is q-local-interacting. Then C 
can be simulated deterministically in T'^^^^ exp[0(qD)\ time. In particular, ifC is a polynomial-size local- 
interacting circuit with a logarithmic depth, then it can be simulated deterministically in polynomial time. 

Yet another important application of our approach is to the simulation of one-way quantum computation. 
In two influential papers 1711261, Briegel and Raussendorf introduced the concept of graph states — quantum 
states derived from graphs, — and show that an arbitrary quantum circuit can be simulated by adaptive, 
single-qubit measurements on the graph state derived from the grid graph. Note that the graph state for 
a one-way quantum computation does not depend on the quantum circuit to be simulated (except that its 
size should be large enough) and that for most physical implementations single-qubit measurements are 
much easier to implement than multi-qubit operations. Hence it is conceivable that graph states would be 
manufactured by a technologically more advanced party, then used by other parties with lesser quantum- 
computational power in order to facilitate universal quantum computing. This makes one-way quantum 
computation an attractive scheme for physical implementations of universal quantum computation. An 
experimental demonstration of one-way quantum computation appeared in a recent Nature article [39 ]. 

A natural question about one-way computation is to characterize the class of graphs whose graph states 
are universal for quantum computation. We call a family of quantum states (j) = {|(j)i), |(|)2), • • • , | (])„),••• } 
universal for one-way quantum computation if (a) the number of qubits in is bounded by a fixed poly- 
nomial in n; (b) any quantum circuit of size n can be simulated by a one-way quantum computation on |(j)„). 
On the other hand, (|) is said to be efficiently simulatable if any one-way quantum computation on |(|)„) can 
be efficiently simulated classically for all sufficiently large n. Note that the class of universal families and 
that of efficiently simulatable families are disjoint if and only if efficient quantum computation is indeed 
strictly more powerful than efficient classical computation. We show that it is necessary for graphs to have 
high treewidth so that the corresponding graph states are not efficiently simulatable. 

Theorem 1.6. Let G be a simple undirected graph. Then a one-way quantum computation on the respective 
graph state can be simulated by a randomized algorithm in time |V(G)|^*^'^ exp[C?(tw(G))]. 

Our simulation can be made deterministic with a better upper bound on time complexity if the one-way 
computation satisfies additional constraints, such as those in |[26l . We shall elaborate on this improvement 
in Section [6l 

An important limitation of our techniques is that a circuit family with sufficiently fast-growing treewidth 
may require super-polynomial resources for simulation. In particular, this seems to be the case with known 
circuits for modular exponentiation. Therefore, there is little hope to efficiently simulate number-factoring 
algorithms using tree decompositions. As an extreme example to illustrate the limitation of our technique, 
we give a depth-4 circuit — including the final measurement as the 4th layer — that has large treewidth. 

Theorem 1.7. There exists a depth-4 quantum circuit on n qubits using only one- and two-qubit gates such 
that its treewidth is Q.{n). 
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Note that a circuit satisfying the assumption in the above theorem must have 0{n) size. Our construction 
is based on expander graphs, whose treewidth must be linear in the number of vertices (Lemma [S!2l) . 

This finding is consistent with the obstacles to efficient simulation that are evident in the results of Terhal 
and DiVincenzo |[33l . later extended by Fenner et al. ||T4||. In contrast, we are able to efficiently simulate 
any depth-3 circuit deterministically while the simulation in |[33l is probabilistic. 

Theorem 1.8. Assuming that only one- and two-qubit gates are allowed, any polynomial- size depth-?) quan- 
tum circuit can be simulated deterministically in polynomial time. 

Our simulation algorithm is related to algorithms for other tasks in that its runtime depends on the 
treewidth of a graph derived from the input. Bodlaender wrote an excellent survey HI on this subject. 
Particularly relevant are algorithms based on "vertex eliminations", e.g., the Bucket Elimination algorithm 
for Bayesian Inference ifTTl . Another parallel can be made with the work by Broering and Lokam flOl . 
which solves Circuit-SAT in time exponential in the treewidth of the graph of the given circuit. However, to 
our best knowledge, we are the first to relate the treewidth of a quantum circuit to its classical simulation. 

Our results are applicable to the simulation of classical probabilistic circuits, which can be modeled by 
matrices, similarly to quantum circuits. Such simulation has recently gained prominence in the literature 
on the reliability of digital logic [18], and is particularly relevant to satellite-based and airborne electronics 
which experience unpredictable particle strikes at higher rates. 

The rest of the paper is organized as follows. After introducing notation, we describe how quantum 
circuits and their simulation can be modeled by tensor networks. The runtime of such simulation depends 
on the graph parameter that we call the contraction complexity. We then relate the contraction complexity 
to treewidth, and apply the simulation to restricted classes of graphs, and to one-way quantum computa- 
tion. Finally, we discuss possible directions for future investigations with a brief survey on the subsequent 
development since the announcement of our results. 

2 Notation and definitions 

def 

For integer « > 1, define [n] = { 1 , 2, . . . , n}. An ordering 7i of an ^-element set is denoted by 7i(l), 7i(2), . . ., 
%{n). Unless otherwise stated, graphs in this paper are undirected and may have multiple edges or loops. 
Edges connecting the same pair of vertices are called parallel edges. If G is a graph, its vertex set is denoted 
by V{G) and its edge set by E{G). When it is cleai^ in the context, we use V = V{G) and E = E{G). The 
degree of a vertex v, denoted by d{v), is the number of edges incident to it. In particular, a loop counts as 1 
edge. The maximum degree of a vertex in G is denoted by A(G). 

Treewidth of a graph. Let G be a graph. A tree decomposition of G |[28l is a tree T , together with a 
function that maps each vertex w G V(T) to a subset B^. C V{G). These subsets B„. are called bags (of 
vertices). In addition, the following conditions must hold. 

(Tl) V}vev(T)^v — ^(^)' i-^-' ^^'^h vertex must appear in at least one bag. 

(T2) V {m,v} G E{G), 3w G V(T), {m,v} C B„, i.e., for each edge, at least one bag must contain both of its 
end vertices. 

(T3) V M G y{G), the set of vertices w G V(T ) with u G B„ form a connected subtree, i.e., all bags containing 
a given vertex must be connected in T . 
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Figure 1: A graph and its decomposition of width 2 with 6 bags. 

The width of a tree decomposition is defined by maXn,£y(,2-) \B„\ — 1. The treewidth of G is the minimum 
width over its tree decompositions. For example, all trees have treewidth 1 and single cycles of length at 
least 3 have treewidth 2. Figure [T] shows an example of tree decomposition. Intuitively, a tree decomposition 
T is a way of drawing a graph to look like a tree, which may require viewing sets of vertices (bags) as single 
vertices. The less a graph looks like a tree, the larger the bags become. The notion of tree decomposition has 
been useful in capturing the complexity of constraint satisfaction problems, Bayesian networks and other 
combinatorial phenomena represented by graphs. In further writing, we may refer to a vertex in T by its 
bag when the context is clear. 

Treewidth can be defined in several seemingly unrelated ways, e.g., as the minimum k for which a given 
graph is a partial k-tree, as the induced width (also called the dimension), or as the elimination width BOI ISl. 
An elimination ordering 71 of a graph G is an ordering of V{G). The induced width of a vertex v G V{G) 
in the ordering is the number of its neighbors at the time it is being removed in the following process: start 
with 7i(l), add an edge for each pair of its neighbors that were previously not adjacent, remove 7i(l), then 
repeat this procedure with the next vertex in the ordering. The width ofn is the maximum induced width of 
a vertex, and the induced width ofG is the minimum width of an elimination ordering. It is known that the 
induced width of a graph is precisely its treewidth [5]. 

It follows straightforwardly from the definition of treewidth that if G is obtained from G' by removing a 
degree 1 vertex, tw(G) = tw(G'), unless G' has only 1 edge, in which case tw(G) = and tw(G') = 1. We 
will also use the following well known and simple fact, a proof for which is provided in the Appendix. 

Proposition 2.1. Let G be a simple undirected graph, and w be a degree 1 vertex. Then removing w and 
connecting its two adjacent vertices does not change the treewidth. 

Quantum circuits. We review some basic concepts of quantum mechanics and quantum computation. For 
a more detailed treatment, we refer the readers to the book by Nielsen and Chuang |[24l . 

The state space of one qubit is denoted by 9i C^. We fix an orthonormal basis for H and label the 
basis vectors with |0) and 1 1). The space of operators on a vector space V is denoted by L(V). The identity 
operator on V is denoted by ly, or by / if V is implicit from the context. A density operator, or a mixed 
state, of n qubits is a positive semi-definite operator p G L(:?/®") with tracep = 1. For a binary string 

def def 

X = x\X2 ■■ - Xn G {0, 1}", let Pv = be the density operator of the state \x) = ®'J^\\xi). 

In this paper, a quantum gate with a input qubits and b output qubits is a superoperator 2 : L(i?/'®") ^ 
L(^®''). There are certain constraints that Q must satisfy in order to represent a physically realizable 
quantum operation. We need not be concerned about those constraints as our simulation method does not 
depend on them. In existing applications one typically has a>b and often a = b, though a density operator 
can also be regarded as a gate with a = 0. The ordering of inputs and outputs is in general significant. If 
2 is a traced out operator, then b = 0, and 2(|jc)(y|) = {x\y), for all x,y G {0, 1}". We denote by Q[A] the 
application of Q to an ordered set A of a qubits. 
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The information in a quantum state is retrieved through the application of measurements. A POVM 
(Positive Operator- Valued Measure) 5W on n qubits is a set 5Vf = {Mi ,M2, • • • ,M<.}, where each M, is called 
a POVM element, and is a positive semi-definite operator in L(:tf ®") such that Y!i=i = ^- The single-qubit 
measurement in the computational basis is {|0)(0|, |1)(1|}. 

We assume that the maximum number of qubits on which a quantum gate can act is bounded by a constant 
(often two or three). A quantum circuit of size T with n input-qubits and m output-qubits consists of the 
following: 

(1) A sequence of n input-wires, each of which represents one input-qubit, i.e., a qubit which is not the 
output qubit of any gate. 

(2) A sequence of T quantum gates gi, g2, ■ ■ ., gr, each of which is applied to some subset of the wires. 

(3) A sequence of m output-wires, each of which represents an output-qubit, i.e., a qubit which is not the 
input qubit of any gate. 

Note that by the above definition, a quantum circuit C defines a function C : L(:tf ®") L(:?/®"'). In most 

def 

applications, a circuit C is applied to an input state = ®'i=i\xi){xi\, for some binary string x = xi ■ ■ - x^ G 
{0, 1}", and at the end of the computation, measurements in the computational basis are applied to a subset 
of the qubits. We shall restrict our discussions to such case, though our results can be extended to more 
general cases. 

The graph of a quantum circuit C, denoted by Gc, is obtained from C as follows. Regard each gate as a 
vertex, and for each input/output wire add a new vertex to the open edge of the wirejl Each wire segment 
can now be represented by an edge in the graph. 

3 Tensors and tensor networks 

Tensors, commonly used in physics, are multi-dimensional matrices that generalize more traditional tools 
from linear algebra, such as matrix products. Here we focus on features of tensors that are relevant to our 
work. 

Definition 3.1. A rank-^ tensor in an m-dimension space g = [gi,j2,--Jk]ii-i2,--Jk wi'^-dimensional array 
of complex numbers g,!,,^,...,^, indexed by k indices, i\, (2, ■ ■ ., 4> each of which takes m values. When the 
indices are cleai^ we omit them outside the bracket. 

For example, a rank-0 tensor is simply a complex number, and a rank-1 tensor is a dimension-m complex 

def 

vector. We focus on dimension-4 tensors, and set the range of each index to be IT = {|fti)(Zj2| : ^1,^2 £ 
{0, 1}}. We fix the following tensor representation of a density operator and a superoperator. 

Definition 3.2. Let p be a density operator on a qubits. The tensor of p is [Pai,a2,- -.aa]ai,a2 •■,Oflen> where 

Pa,,...,a„ =^Kp•(®tl<JO'^)• 
Let 2 be a superoperator acting on a input qubits and b output qubits. The tensor of Q is 

2ai,C2,---,o<,,i:i,T:2,---,i:i,]oi,...,0a,i:i,...,Tien, 

where 

Ga,,a2,-,a„,T,,T2,-,T, = tr{Q{®U<^i) ■ (®5=iXy)^). 
■'These vertices are going to represent input states, as well as measurements and trace-out operators at the end of the computation. 
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We shall use the same notation for a density operator (or a superoperator) and its tensor. We now define 
the central object of the paper. 

Definition 3.3. A tensor network is a collection tensors, each index of which may be used by either one or 
two tensors. 

A rank-^ tensor g can be graphically represented as a vertex labeled with g, and connected to k open wires, 
each of which is labeled with a distinct index. We may represent a tensor network by starting with such 
graphical representations of its tensors, and then connecting wires corresponding to the same index. Note 
that now each wire corresponds to a distinct index. Also, an index that appears in one tensor corresponds to 
an open wire, and an index that appears in two tensors corresponds to an edge connecting two vertices. Parts 
(a) and (b) in Figure [2] give an example of the graphical representation of a tensor and a tensor network. In 
the tensor gQ, we call the a, wires, 1 < / < a, input wires, and the Xj wires, I < j <b, the output wires. 




(a) (b) (c) (d) 



Figure 2: A rank-4 tensor is illustrated in (a), and a tensor network with four tensors is shown in (b). 
Contraction of two tensors is illustrated in (c) and (d). 

Suppose in a tensor network, there are I parallel edges i\ Jj, ■ ■■, h between two vertices g = [g/i ....,/, ,;, y^] 

and h = [/j;;,...,,-, j'|,...j-^J. We may contract those edges by first removing them, then merging and v/, into a 
new vertex v/, whose tensor is / = [/;,,. ...y^.y; y^,], and 

fju...,jk,j\,...,j[, = 22 Sii,...,itJ,,....jk ■ \,...,ic,j\,...,j[r (1) 

'1 ,'2, 

Parts (c) and (d) in Figure [2]illustrate the above contraction. Note that a tensor network with k open wires 
can be contracted to a single tensor of rank k, and the result does not depend on the order of contractions. 
The following example is instructive. 

Example 1. Let p be an a-qubit density operator and 2 be a superoperator with a input qubits and b output 
qubits. Consider the tensor network that connects all wires of the tensor p to the input wires of the tensor Q. 
Then contracting this tensor network gives the tensor of the density operator 2(p). Figure |3] illustrates this 
example. 

A quantum circuit C can be naturally regarded as a tensor network N{C): each gate is regarded as the 
corresponding tensor. The qubit lines are wires connecting the tensors, or open wires that correspond to the 
input and output qubits. Figured illustrates the concept. 

Let C be a quantum circuit with n input qubits and m output qubits. Suppose that C is applied to the initial 
state Px, for some x G {0, 1}", and we are interested in knowing the probability of observing some particular 
outcome when some single-qubit measurements are applied to a subset of the qubits. The setting can be 
described by a measurement scenario defined as follows. 

Definition 3.4. Let m > 1 be an integer. A measurement scenario on m qubits is a function x : [m] L(C^), 
such that x(/) is a single-qubit POVM measurement element. 
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Figure 3: Contracting the wires connecting the tensors for a density operator p and a gate Q results in the 
tensor for Q{p). 



Note that if a qubit / is not to be measured, we can set x(/) = /. 

To compute the probability that x is realized on C(p.v), we build a tensor network N{C;x,x) from N{C) 
by attaching to each input open wire / the tensor for and attaching to each open wire for the output 

qubit / the tensor for t(/). When x = 0", we abbreviate N{C;x,x) as N{C;x). Figured illustrates the concept 
of N{C) andAf(C;x). 



|0>(0| 



|0>(0| 





c 



c 



O 

trace((8);ii t{i) C(10)(01®")) 



N{C) 



O O _ - 

t(1) r(2) r(3) r(4) 
N{C;t) 



(a) 



(b) 



Figure 4: In (a), a circuit C can be naturally regarded as a tensor network N{C). Contracting N{C) gives 
the tensor for the operator that C realizes. Part (b) illustrates the tensor network N{C;'z), contracting which 
gives the rank-0 tensor whose value is precisely the probability that the measurement scenario X is reaUzed 
onC(|0)(0|®"). 



Proposition 3.5. Let C be a quantum circuit, x be a binary string, and 1 be a measurement scenario. 
Contracting the tensor network N{C;x,z) to a single vertex gives the rank-0 tensor which is the probability 
that X is realized on C{px)- 

Proof. Let '= gtgt-\ ■■■gi {Px), 1 < ? < and p" = p^. By the definitions of tensors for density operators 
and superoperators and tensor contraction, contracting wires connecting the tensor of a superoperator Q and 
the tensors for a density operator p gives the tensor of Q{p). Thus sequentially contracting input wires of 
§i> gives the tensor for p', and contracting the remaining wires gives the tensor for x(p^), which is 

the probability of realizing x on p^ = C{px). □ 

We remark that N{C;x,x) is not the only tensor network for which the above Proposition holds. 
Although the ordering of the edges in the contraction process does not affect the final tensor, it may 
significantly affect space and time requirements. 
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Proposition 3.6. Given a tensor network N of a size T quantum circuit, and a contraction process specified 
by an ordering of wires in N, let d be the maximum rank of all the tensors that appear in the process. Then 
the contraction takes C?(rexp[C?((i)]) time. 

Proof. Note that the size of A'^ is ®{T). The algorithm stores the tensors of each vertex. When contracting 
an edge, it computes the new tensor according to Equation [H and updates the tensor accordingly. This takes 
exp[C?((i)] time. Hence the total runtime is 0{T &xp[0{d)]). □ 

In the next Section we will investigate near-optimal orderings for simulation and ways to find them. 
While traditional simulation of quantum circuits proceeds in the same order in which the gates are applied, 
it appears that an optimal ordering may not have any physical meaning. Therefore, we formalize this opti- 
mization using abstract graph contractions. 

4 Contraction complexity and treewidth 

Let G be a graph with vertex set V{G) and edge set ^(G). Recall that the contraction process discussed in 
the previous Section removes parallel edges in one step because contracting one edge at a time can create 
multiple loops. However, for future convenience we prefer the latter simulation and therefore allow loops to 
remain not contracted, counting toward the degree of a vertex. Note that if a "parallel" contraction contracts 
I edges between two vertices u and v of degrees l + k and l-\-k' , respectively, the corresponding "one-edge- 
at-a-time" conti'action would create vertices of degrees k + k' + l—\,k + k' + 1 — 2, ■ ■ ■ ,k + k' , each of which 
is < d{u) +d{v). Thus the one-edge-at-a-time contraction process can emulate the parallel contraction, 
while increasing the maximum vertex degree observed by no more than two-fold. We make the definition of 
this new contraction process precise below. 

Definition 4.1. The contraction of an edge e removes e and replaces its end vertices (or vertex) with a 
single vertex. A contraction ordering n is an ordering of all the edges of G, 7i(l), n{2), . . ., n{\E{G)\). The 
complexity of n is the maximum degree of a merged vertex during the contraction process. The contraction 
complexity of G, denoted by cc(G), is the minimum complexity of a contraction ordering. 

Since only the degrees of the merged vertices are considered in defining the contraction complexity, cc(G) 
could be strictly larger than A(G). For example, if G is a path, cc(G) = 1 and A(G) = 2. 

Note that sequentially contracting all 7i(/), 1 < / < [^(G)!, reduces G to a single vertex (or an empty graph 
of several vertices). Also, for any graph G, cc(G) < |£'(G)| — 1, since any merged vertex would be incident 
to no more than |£'(G)| — 1 number of edges. Furthermore, cc(G) > A(G) — 1, since when an edge incident 
to a vertex of degree A(G) is removed, the resulting merged vertex is incident to at least A(G) — 1 edges. 

The nature of cc(G) becomes clearer once we consider the line graph of G, denoted by G*. That is, the 

def 

vertex set of G* is V{G* ) = E{G), and the edge set is 

def 

E{G*) = {{^1,^2} ^ E{G) : e\ 7^ e2, 3v G V(G) such that e\ and e2 are both incident to v}. 

Proposition 4.2. For any graph G = (y^E), cc(G) = tw(G*). Furthermore, given a tree decomposition of 
G* of width d, there is a deterministic algorithm that outputs a contraction ordering 71 with cc(7l) < d in 
polynomial time. 

Computing the treewidth of an arbitrary graph is NP-hard lH, but we do not know if this remains true 
for the special class of graphs G*. Nevertheless, this is not critical in our work since the constant-factor 
approximation due to Robertson and Seymour ll29l suffices for us to prove our key results. 
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Theorem 4.3 (Robertson and Seymour |[29l ). There is a deterministic algorithm that given a graph G 
outputs a tree decomposition ofG of width C?(tw(G)) in time 1^(0)1*^^^^ exp[C?(tw(G))]. 

Proof of Proposition \4. 21 There is a one-to-one correspondence of the contraction of an edge in G and the 
eUmination of a vertex in G* , and the degree of the merged vertex resulting from contracting an edge e in G 
is the same as the degree of e being eliminated in G*. Thus cc(G) = tw(G*). 

To prove the second part of the statement, denote the tree decomposition by T . Repeat the following 
until the tree decomposition becomes an empty graph. Choose a leaf i inT . If £ is the single vertex of T , 
output vertices (of G*) in Bi in any order. Otherwise, let i' be its parent. If B( C B(i, remove i and repeat 
this process. Otherwise, let e ^B^ — B(i. Output e, remove it from the tree decomposition and continue the 
process, until all vertices of the tree decomposition are removed. The number of steps in this process is 
polynomial in the size of the tree decomposition. 

Note that each output e appears in only one bag in the tree decomposition. Therefore, all (current) 
neighbors of e must appear in the same bag. Hence its induced width is at most d. By the one-to-one 
con^espondence of the vertex elimination in G* and the contraction process in G, cc{%) <d. □ 

Before we complete the description of our simulation algorithm, we relate the treewidth of G to that of 
G*. This is useful for reasoning about quantum circuits C when the graph Gc is easier to analyze than 
its line graph G^. In such cases one hopes to bound the runtime of the simulation algorithm in terms of 
parameters of G rather than G* . Fortunately, since Gc is of bounded degree, the treewidths of Gc and 
are asymptotically the same. 

Lemma 4.4. For any graph G of maximum degree A(G), 

(tw(G) - l)/2 < tw(G*) < A(G)(tw(G) + 1) - 1. 

Proof. From a tree decomposition T of G of width d we obtain a tree decomposition T * of G* of width 
{d+ \ ) • A(G) — 1 by replacing each vertex v G V{G) with all edges e incident to v. This guarantees that 
every edge of G* is in some bag, i.e. (Tl) is true. Item (T2) is true since if e\ and e2 are both incident to a 
vertex u in G, then any bag in T containing u contains both e\ and ^2 in T*. To verify Item (T3), suppose 
that e connects u and v in V{G). Take two bags a and b that both contain e. Then in T , both bags a and b 
must have either u or v. If they contain the same vertex, then a and b are connected, by (T3). Otherwise, 
there must be a bag c that contains both u and v, by (T2). So a and b are connected through c. Therefore we 
have proved that tw(G*) < A(G)(tw(G) + 1) - 1. 

Now to prove tw(G) < 2tw(G*) + 1, we start with a tree decomposition T* of G* of width d, and replace 
every e by its two end vertices in V{G). The verification of (Tl) through (T3) can be accomplished in a 
similar way. □ 

Note that the above bounds are asymptotically tight, since for an m-ary tree (of which each non-root 
internal vertex has degree m + 1), the treewidth is 1 and the contraction complexity is m. We summarize the 
above finding in the following theorem. 

Theorem 4.5. Let d >l be an integer. For any family of graphs Gn, n G N, such that A(G„) < d, for all n, 
then 

(tw(G„) - l)/2 < cc(G„) = tw(G*) < d{tw{Gn) + 1) - 1, Vn G N. 
We are now ready to put everything together to prove the following restatement of Theorem ll.il 
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Theorem 4.6. Let C be a quantum circuit of size T and with n input and m output qubits, x G {0, 1}" be an in- 
put, and X : [m] — > L(C^) be a measurement scenario. Denote by Gc the underlying circuit graph ofC. Then 
the probability that X is realized on C(p.v) can be computed deterministically in time T^^'' exp[C?(cc(Gc))] = 
r(i)exp[0(tw(Gc))]. 

Proof. The following algorithm computes the desired probability. 

(1) Construct iV = Af(C;x,x). 

(2) Apply the Robertson-Seymour algorithm to compute a tree decomposition T of A'^* of width w = 
0{tw{N*)) (Theorem |43]l. 

(3) Find a contraction ordering n from T (Proposition 14.21 ) of width w. 

(4) Contract A'^ using n, and output the desired probability from the final (rank-0) tensor (Proposition 13.51 ). 

The runtime bottlenecks are Steps ^ and both taking time T^^'^ exp(0[tw(A'^*)]), which by Theo- 
remOis r^(') exp[0(cc(Gc))] = T^f^' exp[0(tw(Gc))]. In fact, Steps 2 and 4 can be combined, but we 
separate them for the sake of clarity. □ 

5 Treewidth and quantum circuits 

In this section we prove the implications of Theorem 11.11 stated in the Introduction. A number of tight 
bounds for the treewidth of specific families of graphs have been published, including those for planar and 
series-pai^allel graphs. However, similar results for graphs derived from quantum circuits are lacking. To 
this end, we strengthen Corollary 1 1.5 1 as follows. 

Proposition 5.1. Let C be a quantum circuit in which each gate has an equal number of input and output 
qubits, and whose qubits are index by [n], for an integer n > I. Suppose that the size of C is T, and r is 
the minimum integer so that for any i, 1 < / < « — 1, no more than r gates act on some qubits j and f with 
j <i < f. Then C can be simulated deterministically in time T^^^'' exp[0(r)]. 

Corollary [T3] follows since r = 0{qD) under its assumption. 

Proof of Proposition 15. i I Assume without loss of generality that tw(Gc) > 2. Let G be the graph obtained 
from Gc by removing degree 1 vertices and contracting edges incident to degree 2 vertices. Then tw(G) = 
tw(Gc), by Proposition 12. II and the observation stated before it. Then each vertex in G con^esponds to a 
multi-qubit gate in C. 

We now construct a tree decomposition T for G that forms a path of ?i — 1 vertices Bi —Bj— \. 

The bag S; of the vertex (1 </<«— 1) consists of multi-qubit gates (vertices) that act on some qubits j 
and / with j < i < /. Hence |B; | < r by the assumption. If u acts on qubits /i , /2 , • • • ,ik, ' i < '2 < • • • < ik, 
then u G B,, for all /, /'i < / < 4. Thus (Tl) and (T3) are true. If a wire segment corresponding to the qubit 
/ connects two gates u and v, the bag B,- contains both u and v. Thus (T2) is true. Therefore Tis a tree 
decomposition for G with width r — 1. Hence tw(Gc) = tw(G) = 0{r), which by Theorem 11.11 implies that 
C can be simulated in r^^') exp[0(r)] time. □ 

We now turn to quantum circuits of bounded depth. To prove Theorem 1 1.7 1 we will make use of the fol- 
lowing observation that relates expander graphs to contraction complexity. Let <i be a constant and {Gn}neN 
be a family of cf-regular graphs, and £ > be a universal constant. Recall that {G„} is called a family of 
expander graphs with expansion parameter £ if, for any subset S C V(G„) with l^l < |V(G„)|/2, there are no 
less than e\S\ edges connecting vertices in S with vertices in V{G) — S. 
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Lemma 5.2. For an expander graph Gn with the expansion parameter £, cc(G„) > £|V(G„)|/4. 

Proof. Fix a contraction ordering of G,,- Let v be the first merged vertex so that ky, the number of vertices 
in V{G„) that were eventually merged to v, is at least |V(G„)|/4. Then < \V{G,t)\/2, and v must have 
degree £|V(G„) 1/4. □ 

The following graph is shown to be an expander by Lubotzky, Phillips and Sarnak |[T9l . Let > 2 be a 

prime, and G,j be the graph with V{Gp) '= ZpU {°°}, and every vertex x is connected to ;c+ 1, x — 1 and x^^ 
(oo lb 1 are defined to be oo). Note that Gp is a 3-regular graph. 

Proof of Theorem \1. 71 By Lemma \5?2\ cc(Gp) = Q.{p). Since Gp is a 3 regular graph, tw(Gp) = 
&{cc{Gp)) = 0.{p), by Theorem 14.51 Let G'p be the graph obtained from Gp by removing the vertex oo 
and the edge {0,/? — 1}. This would only decrease tw(Gp) by at most constant. Hence tw(G^) = 0.{p). 
Therefore to prove the theorem, it suffices to construct a quantum circuit C on p qubits so that G'p is a minor 
ofG*. 

Each qubit of C corresponds to a distinct vertex in V (G^). Observe that edges in E{G'p) can be partitioned 
into thi^ee vertex-disjoint subsets: (1) {x,x^'}; (2) {x,x+ 1} for even x, < x < p — 3; (3) the remaining 
edges. Each subset gives a layer of two-qubit gates in C. In G^, contracting all the vertices that correspond 
to the same qubit gives a graph of which G'p is a minor. Hence tw(C) = 0(tw(G^)) = 0.{p). □ 

Proof of Theorem \1.8\ By Theorem 14.51 it suffices to prove that cc (Gc) = 0(1) for any depth-2 circuit. 
Observe that for any such circuit, after contracting the input and output vertices (those are of degree 1, 
hence contracting them will not increase the contraction complexity), every vertex in Gc has degree either 
1 or 2. Hence the edges can be decomposed into disjoint paths and cycles, which can be contracted without 
increasing the degree. Hence cc (Gc) < 2. □ 



6 Simulating one-way quantum computation 

This section revisits the notions of graph states and one-way quantum computation. We first simulate one- 
way computation with an algorithm whose complexity grows exponentially with the contraction complexity 
of the underlying graph. We then reduce general one-way computation to the special case where the vertex 
degree is bounded by a constant. Since for such graphs the contraction complexity is the same as the 
treewidth (up to a constant), this reduction facilitates a more efficient simulation algorithm, as stated in 
Theorem 1 1.6 1 

Let G = iy,E) be a simple undirected graph with \V\ = n. For a subset V' C V , denote by eiy') the 
number of edges in the subgraph induced by V' . We associate a qubit with each vertex v G V, and refer to it 
by qubit v. For a subset V' C V , we identify the notation \V') with the computational basis |x), for x G {0, 1}" 
being the characteristic vector of V' (i.e., the bit of x is 1 if and only if the Z"* vertex under some fixed 
ordering is in V'). The graph state |G) is the following n-qubit quantum state ||7j 

Note that |G) can be created from |0") by first applying Hadamard gates to all qubits, followed by the 
Controlled-Phase gate A(a') = Lii,i2e{o,i}(~l)^' ''^I^i)^2)(^i5^2| on each pair of qubits u and v with 
{m,v} G E. Since all the A(a") operators commute, the order of applying them does not affect the result. 
A basic building block of our simulation algorithm is the following. 
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Lemma 6.1. Let G = iy,E) be a graph with n vertices, and z be a measuring scenario (defined in Defini- 
tion's}) on n qubits. Then the probability p that X is realized on \G) can be computed detenninistically in 
time 0(|V|^(^)exp[(9(cc(G))]). 

Proof. Fix a circuit Cg that creates |G) from |0)(0|®". Let {m,v} G E, and g = gu+ ,u- ,v+ .v- be a tensor in 
N{Cg',i) corresponding to A(a^)[M,v]. The wires representing the qubit u (or v) before and after the gate 
are labeled m+ (or v+) and u^ (or v^), respectively. We replace g by two tensors g" = g"^+ ^+ ^_ and 
g'' = ^+ which share two labels f+ and t^ and are defined as follows. For a wire segment with a 
label a, denote by L^, the 4-dimensional space of linear operators associated with this wire segment. Set 
to be the identity superoperator that maps L„+ ® L,- L,+ i^Lj,-, and g^' to be the tensor for a A(a') 
that maps Li+ (8'Lv+ — > L,- ^Ly-. By their definitions, contracting and g^' gives precisely g. We call the 
inserted wires labeled with f+ and t^ transition wires. See Figure|5]for an illustration. 



u 



u 



u 



t+ 

9^1 J 9 
t 



Figure 5: Replacing a tensor g corresponding to cJj.[m, v] by two tensors g" and g^. 

Denote by N'{Cg','^) the tensor network obtained from A^(Cg;x) by applying the above replacement pro- 
cedure for each edge in E. Let G' be the underlying graph of N'{Cg','z). Note that G' has the maximum 
degree 4 and the number of vertices is Od^j). See Figure[6]for an illustration. Thus p can be computed by 
contracting N'{Cg;'c) in time OdVl^^^^) exp[0(cc(G'))]), according to Theorem l46l 




Figure 6: For a graph G in (a), the tensor network N{Cg', x) is shown in (b). Input vertices are at the top, and 
output vertices are at the bottom. Each box is a tensor corresponding to a A(aj.) applied to qubits adjacent 
in G. In (c), each A(a-) tensor is replaced by two tensors and two wires connecting them, as described in 
Figure [51 Contracting all solid lines in (c) produces the graph in (d), which is precisely G with each edge 
doubled. 

We now prove that cc(G') = 0(cc(G)). This can be seen by contracting all wire segments corresponding 
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to the same qubit in G' , while leaving the transition wires untouched. Since contracting the edge incident 
to an input or output vertex results in a new vertex of degree 3, and contracting the rest of the wires for a 
qubit V results in a new vertex of degree 2d{v), the maximum degree of a merged vertex in this process is 
max{3,2A(G)}. The one-to-one con^espondence between the resulting vertex set and V induces naturally a 
one-to-one coiTcspondence between the pairs of transition wires and E. Thus a contraction ordering of G 
gives a contraction ordering of G' (of this stage) with at most twice of the contraction complexity. Therefore 

cc(G') < max{3,2A(G),2cc(G)} = 0(cc(G) + 1). 

Thus p can be computed deterministically in time 1*^(1) exp[0(cc(G'))]) = exp[0(cc(G))]). 

□ 

A one-way computation on a quantum state |(|)) consists of a sequence of adaptive single-qubit measure- 
ments and single-qubit unitary operations applied to |(|)). The description of each measurement or unitary 
operation, including the index of the qubit that it acts on, can be computed by a deterministic and efficient 
(polynomial time) algorithm from previous operations and their measurement outcomes. In our discussion 
we treat this computation time as a constant. We call a one-way quantum computation oblivious if before the 
last measurement (which produces the outcome of the computation), different computational paths involve 
the same number of measurements, take place with the same probability, and result in an identical state. 
Note that the one-way computation of Raussendorf and Briegel |[26i is oblivious. 

We point out that allowing single-qubit unitary operations in the definition is for the convenience of 
discussion only, since each single-qubit unitary can be combined with a future measurement on the same 
qubit (should there be one). To see this fact, let us call two quantum states LU -equivalent (where LU stands 
for Local Unitary), if there exists a set of single-qubit unitary operations applying which maps one state to the 
other. A one-way computation with unitary operators always has an almost identical one-way computation 
without unitary operations: the measurements are in one-to-one correspondence with identical outcome 
distributions, and the states after corresponding measurements ai^e LU-equivalent. Therefore, when we are 
only interested in the distribution of the measurement outcomes, we may assume without loss of generality 
that a one-way computation does not involve any unitary operation. 

We now derive a simulation algorithm whose complexity depends on the contraction complexity. 

Lemma 6.2. A one-way quantum computation on a graph G = (V,^) can be simulated by a randomized 
algorithm in time C?(|V|^'^^ exp[C?(cc(G)]). If the one-way computation is oblivious, the simulation can be 
made deterministic. 

Proof. Let T be the number of measurements during the one-way computation. Assume without loss of 
generality that no single-qubit unitary operation is applied. The simulation consists of T steps, one for each 
single-qubit measurement. It maintains a data structure r = (x,/?), where x is a measurement scenario, and p 
is the probability that x is realized on |G). Denote by = {Xt,pt) the value of r when t measurements have 
been simulated. Initially Xo(/) = / for all i, I <i <n, and po = 1- 

Suppose we have simulated the first t — \ measurements, \ <t <T — \. 

(1) Based on the one-way algorithm, compute from x,_i the description of the t''^ measurement = 
{if ,P}} and the qubit at that it acts on. Denote by xf the measurement scenario identical to X(_i, 
except that x['(a;f) =P°. 

(2) Compute pj*, the probability of realizing xj*. By Lemma [Ol this takes exp[0(cc(G))]) time. 
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(3) Flip a coin that produces with probability /pt-\, resulting in an outcome Z?, G {0, 1}. Set to be 
identical to x,_i, except that x{at) = p^' . Set = (1 — bt)p^ + b,{pt-\ — p^)- Continue the simulation 
until t = T. 

By construction, the output distribution is identical to that of the one-way computation. The complexity 
of the algorithm is |^(') exp[cc(G)]). 

If the one-way computation is oblivious, there is no need to adaptively simulate the first T — \ mea- 
surements, as all of them lead to the same state with the same probability pt-\- Let t,t-\ (Xr) be the 
measurement scenario corresponding to the first T — \ {T , respectively) measurements giving the outcome 
0. We compute the probabilities pt-\ and pj that Xj-x and Xj are realized. Then the probability that 
the one-way computation produces is precisely Pt/pt-\- The computation is deterministic and takes 
|V|^(^)exp[(9(cc(G))] time by LemmaO □ 

The main difference between the above lemma and Theorem 11.61 is that the simulating complexity of 
the former is exponential in cc(G), while that of the latter is exponential in tw(G). Since A(G) is not 
bounded in general, the lemma does not directly imply the theorem. We shall reduce a one-way computation 
on a graph state |G) to a one-way computation on another graph state |G'), such that A(G') = 0(1) and 
tw(G') = 0(tw(G)). Under this reduction, the exponent in the simulating complexity is on the order of 
cc(G') = 0(tw(G')) = 0(tw(G)). Such a reduction was found in ED. Let G and G' be two graphs. We call 
G' an expansion of G if G can be obtained from G' by contracting a set of edges that form a forest. 

Theorem 6.3 ( 11211 ). Any undirected simple graph G = (y,E) has an expansion G' = {V',E') such that 
A(G') < 3, \V'\ = 0{\E\ + \V\), and tw(G') < tw(G) + 1. Furthermore, such a graph G' can be computed 
deterministically from G in exp[0(tw(G))] time. 

In our application we need to insert a vertex into an edge that will be contracted during the transformation 
of the graph G' in the above theorem to G. This is to facihtate the application of the following fact about 
graph states, a proof for which is given in the Appendix. 

Proposition 6.4 ( II26I ). Let G be a graph obtained from a simple undirected graph G' by replacing a vertex 
u G V(G') with three vertices v, w, and v', such that w is adjacent to v and v' only, and each vertex adjacent to 
u in G' becomes adjacent to either v or v', but not both. Then \G') can be obtained from \G) by an oblivious 
one-way computation that makes 2 measurements. 

The use of expansion is illustrated in Figure |7l and summarized in the following Corollary. 

Corollary 6.5. Let G = (y,E) be a simple undirected graph. There exists a graph G\ = (Vi,^!) such that 
(a) A(Gi) < 3, (b) |Vi| = 0{\E\ + \V\), (c) tw(Gi) < tw(G) + 1, (d) Gi can be computed deterministically 
from G in time exp[0(tw(G))], and, (e) |G) can be obtained by an oblivious one-way computation 

on |Gi), 

Proof. Let G' = {V' ,E') be a graph satisfying the properties in Theorem l6.3[ Let E[ C E' be the set of edges 
contracting which would transform G' to G. For each e £ E[, insert a vertex at e (that is, disconnect the end 
vertices of e and connect them to the new vertex). 

We show that the resulting graph G\ satisfies the required properties. Note that by Proposition 12.11 
tw(G') = tw(Gi). Properties of G' implies that Properties (a-e) hold. The composition of the oblivious 
one-way computation in Proposition 16.41 applied to the inserted vertices transforms |Gi) to |G), and is itself 
oblivious. □ 

We are now able to prove this section's main theorem, which restates Theorem 1 1.61 and extends it to the 
case of oblivious one-way computation. 
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Figure 7: To a graph with high-degree vertices in (a) we apply the construction from |I2T1| to produce a 
small-degree expansion in (b) that preserves treewidth. The graph in (c) is obtained from (b) by inserting a 
vertex at each edge. The corresponding graph state can lead to the graph state of (a) through an oblivious 
one-way computation. The graph in (d) illustrates that not every expansion of (a) preserves treewidth ||2TI . 



Theorem 6.6. Let G = {V,E) be a simple undirected graph. Then a one-way computation on G can be sim- 
ulated by a randomized algorithm in time | exp[C?(tw(G))]. The simulation can be made deterministic 
if the one-way computation is oblivious. 

Proof. Let G\ = {y\,E\) be a graph satisfying the properties stated in Corollarv 16.51 Thus \G) can be 
obtained from through an oblivious one-way computation. Therefore, the given one-way computation 
P on\G) can be carried out by a one-way computation P' on\G\) which first produces |G) then continues 
executing P. Note that P' is oblivious if P is. By Lemma 16.21 P' can be simulated by a randomized, or 
deterministic if P is obhvious, algorithm in time O ( | Vi | ^ ) exp [O (cc (G i ) )] ) . Note that cc (Gi ) = O (tw (G i ) ) , 
by Lemma 1131 since A(Gi) < 3. Thus cc(Gi) = 0(tw(G)), since tw(Gi) < tw(G) + 1. Since |Vi| = 
0{\V\ + \E\) = the simulation time complexity is exp[C>(tw(G))]). □ 



7 Discussion 

In this work we studied quantum circuits regardless of the types of gates they use, but with a focus on how 
the gates are connected. We have shown that quantum circuits that look too similar to trees do not offer 
significant advantage over classical computation. More generally, when solving a difficult classical problem 
on a quantum computer, one encounters an inherent trade-off between the treewidth and the size of quantum 
circuits for solving it — the smaller the quantum circuit, the more topologically sophisticated it must be. 
Investigating such trade-offs for specific problems of interest is an entirely open and very attractive avenue 
for future research. Similar considerations may apply to classical circuits. We conjecture that there are 
simple functions, such as modular exponentiation, whose circuit realizations require large treewidth. 

Furthermore, our work raises an intriguing possibility that the treewidth of some quantum circuits may be 
systematically reduced by restructuring the circuit, while preserving the final result of the entire computa- 
tion. Perhaps, future research in this direction can clarify the limits to efficient quantum computation, while 
the tools developed in this context will be useful for practical tasks. 
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The pre-print of this paper ll20l has lead to several follow-up results. Jozsa |[T6l and Aharonov et al. ||3l 
gave alternative proofs for some of our theorems. Furthermore, Aharonov et al. lO, and Yoran and Short POl 
pointed out that Quantum Fourier Transform (QFT) over Z„ admits approximate circuit realizations that, 
viewed as tensor networks, have small treewidth. Given the central role of QFT in known quantum al- 
gorithms, their results are somewhat unexpected and their implications are yet to be fully explored. For 
example, what type of circuits would remain efficiently simulatable when interleaved with QFT circuits? 
In general, as implied by Theorem 11.71 and 11.81 the treewidth of a circuit may increase dramatically under 
composition. Yoran and Short BOl have shown that this drawback may be avoided in some cases. Extending 
their result would deepen our understanding of quantum speed-ups. 

The important question of characterizing quantum states that are universal (or efficiently simulatable) for 
one-way quantum computation remains unsolved. In another follow-up thread, van den Nest et al. ll23l l22l 
defined additional width-based parameters of quantum states and demonstrated results for those parameters 
similar to Theorem 11.61 It is unlikely that the set of quantum states with small width-based parameters 
includes all efficiently simulatable states because a set of simulatable states of high widths was identified 
recently by Bravyi and Raussendorf (91. Nevertheless, it remains plausible that those width-based results 
and their further extensions may be part of a classification theorem that gives a complete characterization of 
efficiently simulatable states. 
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A Proof of Proposition 



2.1 



Recall that a minor of a graph G is a graph obtained from a subgraph of G by contracting edges. A basic 
property of treewidth is that it does not increase under taking minors |[27l . 

Proof of Proposition \2.1\ Let G' be the graph resulting from the contractions. Since G' is a minor of G, 
tw(G') < tw(G) ( |[27l ). If tw(G') = 1, then G' is a non-empty forest (otherwise G has a triangle minor, 
thus tw(G) > 2). Thus G is also a non-empty forest and tw(G) = 1 = tw(G'). Suppose tw(G') > 2. Let 
T be a tree decomposition for G'. We obtain a tree decomposition T' for G by inserting a bag containing 
{m,>v,v}, and connecting it to a bag that contains {m,v}. One can verify directly that the three conditions 
{Ji — T3) that define tree decompositions hold for T'. Since the width of T' is no more than that of T, we 
have tw(G) < tw(G'). Therefore, tw(G) = tw(G'). □ 



B Proof of Proposition \6A 



Proof of Proposition \6.4\ Denote by G the subgraph of G' induced by V(G') — {«}. Let A and A' be vertices 
in V(G') — {u} that are adjacent to v and V , respectively, in G. Note that A nA' = 0, thus A ©A' = A UA' is 
the neighborhood of u in G' . Also, v A' and v' A. 

Starting with |G), we first measure on w. If the outcome is +1, the resulting state is 

= |00)v'v|G) + |ll)v'va,[A©A']|G). (2) 

Otherwise, the resulting state is 

|01),va.[A]|G) + |10),.„a,[A']|G), 

which can be brought to |(|)i) by a;c[v]aj[A]. We then measure Ox[v'] on \^\). If the outcome is +1, then the 
resulting state is precisely |G'). Otherwise it is 

|0)„|G)-|l)a,[A©A']|G), 

which can be brought to |G') by Oz[v\. The four outcomes of the two measurements have equal probability 
(1 /4). Thus the one-way computation is oblivious. □ 
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